Economically organized hierarchies in WordNet and the Oxford English Dictionary
نویسنده
چکیده
Good definitions consist of words that are more basic than the defined word. There are, however, many ways of satisfying this desideratum. For example, at one extreme, there could be a small set of atomic words that are used to define all other words; i.e., there would be just two hierarchical levels. Alternatively, there could be many hierarchical levels, where a small set of atomic words is used to define a larger set of words, and these are, in turn, used to define the next hierarchically higher set of words, and so on to the top-level of very specific, complex words. Importantly, some possible organizations are more economical than others in the amount of space required to record all the definitions. Here I ask, How economical are dictionaries? I present a simple model for an optimal set of definitions, predicting on the order of seven hierarchical levels. I test the model via measurements from WordNet and the Oxford English Dictionary, and find that the organization of each possesses the signature features expected for an economical dictionary. 2008 Elsevier B.V. All rights reserved.
منابع مشابه
Meaningful Clustering of Senses Helps Boost Word Sense Disambiguation Performance
Fine-grained sense distinctions are one of the major obstacles to successful Word Sense Disambiguation. In this paper, we present a method for reducing the granularity of the WordNet sense inventory based on the mapping to a manually crafted dictionary encoding sense hierarchies, namely the Oxford Dictionary of English. We assess the quality of the mapping and the induced clustering, and evalua...
متن کاملAutomatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملMapping Multilingual Hierarchies Using Relaxation Labeling
This paper explores the automatic construction of a multilingual Lexical Knowledge Base from pre-existing lexical resources. We present a new and robust approach for linking already existing lexical/semantic hierarchies. We used a constraint satisfaction algorithm (relaxation labeling) to select -among all the candidate translations proposed by a bilingual dictionarythe right English WordNet sy...
متن کاملBabelNet meets Lexicography: the case of an automatically-built multilingual encyclopedic dictionary
In this paper we provide a first study of the lexicographic quality of BabelNet, a very large automatically-created multilingual encyclopedic dictionary. BabelNet 2.0, available online at http://babelnet. org, covers 50 languages and provides both lexicographic and encyclopedic knowledge for all the open-class parts of speech. It is obtained from the automatic integration of several language re...
متن کاملReducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
WordNet is the reference sense inventory of most of the current Word Sense Disambiguation systems. Unfortunately, it encodes too fine-grained distinctions, making it difficult even for humans to solve the ambiguity of words in context. In this paper, we present a method for reducing the granularity of the WordNet sense inventory based on the mapping to a manually crafted dictionary encoding sen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Cognitive Systems Research
دوره 9 شماره
صفحات -
تاریخ انتشار 2008